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NOVEL PEPTIDE NUCLEIC ACIDS 

RELATED APPLICATION 

This patent application is a continuation-in-part 
of application PCT EP92/01219, filed May 19, 1992. The 
5 entire contents of this application, which published November 
26, 1992 as WO 92/20702, is incorporated herein by reference. 

FIELD OF THE INVENTION 

This invention is directed to compounds that are 
not polynucleotides yet which bind to complementary DNA and 
10 RNA strands. In particular, the invention concerns compounds 
wherein naturally-occurring nucleobases or other nucleobase- 
binding moieties are covalently bound to a polyamide 
backbone . 

BACKGROUND OF THE INVENTION 

15 Oligodeoxyribonucleotides as long as 100 base pairs 

(bp) are routinely synthesized by solid phase methods using 
commercially available, fully automatic synthesis machines. 
The chemical synthesis of oligoribonucieotides , however, is 
far less routine. Oligoribonucieotides also are much less 

2 0 stable than oligodeoxyribonucleotides, a fact which has 
contributed to the more prevalent use of oligodeoxyribo- 
nucleotides in medical and biological research directed zo, 
for example, gene therapy or the regulation of transcription 
or translation. 

25 The function of a gene starts by transcription of 

its information to a messenger RNA (mRNA) which, by 
interaction with the ribosomal complex, directs the synthesis 
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cf a protein coded for by its sequence. The synthetic 
process is known as translation. Translation requires the 
presence of various co-factors and building blocks, the amino 
acids, and their transfer RNAs (tRNA) , all of which are 
5 present in normal cells. 

Transcription initiation requires specific 
recognition of a promoter DNA sequence by the RNA- 
synthesizing enzyme, RNA polymerase. In many cases in 
prokaryotic cells, and probably in all cases in eukaryotic 
10 cells, this recognition is preceded by sequence-specific 
binding of a protein transcription factor to the promoter. 
Other proteins which bind to the promoter, but whose binding 
prohibits action of RNA polymerase, are known as repressors. 
Thus, gene activation typically is regulated positively by 
15 transcription factors and negatively by repressors. 

Most conventional drugs function by interaction 
with and modulation of one or more targeted endogenous 
proteins, e.g., enzymes. Such drugs, however, typically are 
not specific for targeted proteins but interact with other 
20 proteins as well. Thus, a relatively large dose of drug must 
be used to effectively modulate a targeted protein. Typical 
daily doses of drugs are from iO^-lO" 1 millimoles per 
kilogram of body weight or 10' 3 -10 millimoles for a 100 
kilogram person. If this modulation instead could be 
25 effected by interaction with and inactivation of mRNA, a 

dramatic reduction in the necessary amount of drug necessary 
could likely be achieved, along with a corresponding 
reduction in side effects. Further reductions could be 
effected if such interaction could be rendered site- 
30 specific. Given that a functioning gene continually produces 
mRNA, it would thus be even more advantageous if gene tran- 
scription could be arrested in its entirety. 

Oligodeoxynucleotides offer such opportunities. 
For example, synthetic oligodeoxynucleotides could be used as 
3 5 antisense probes to block and eventually lead to the 
breakdown of mRNA. Thus, synthetic DNA could suppress 
translation in vivo. It also may be possible to modulate the 
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genome of an animal by, for example, triple helix formation 
using oligonucleotides or other DNA recognizing agents. 
However, there are a number of drawbacks associated with 
triple helix formation. For example, it can only be used for 
5 homopurine sequences and it requires unphysiologically high 
ionic strength and low pH. 

Furthermore, unmodified oligonucleotides are 
unpractical both in the antisense approach and in the triple 
helix approach because they have short in vivo half- lives, 

10 they are difficult to prepare in more than milligram 

quantities and, thus, are prohibitively costly, and they are 
poor cell membrane penetrators. 

These problems have resulted in an extensive search 
for improvements and alternatives. For example, the problems 

15 arising in connection with double -stranded DNA (dsDNA) 
recognition through triple helix formation have been 
diminished by a clever "switch back" chemical linking whereby 
a sequence of polypurine on one strand is recognized, and by 
"switching back", a homopurine sequence on the other strand 

20 can be recognized. See, e.g., McCurdy, Moulds, and Froehler, 
Nucleosides, in press. Also, good helix formation has been 
obtained by using artificial bases, thereby improving binding 
conditions with regard to ionic strength and pH. 

In order to improve half life as well as membrane 

25 penetration, a large number of variations in polynucleotide 
backbones has been undertaken, although so far not with the 
desired results. These variations include the use of 
methylphosphonates , monothiophosphates , dithiophosphates , 
phosphoramidates, phosphate esters, bridged 

3 0 phosphoroamidates, bridged phosphorothioates , bridged 

methylenephosphonates , dephospho internucleotide analogs with 
siloxane bridges, carbonate bridges, carboxymethyl ester 
bridges, acetamide bridges, carbamate bridges, thioether, 
sulfoxy, sulfono bridges, various "plastic" DNAs , a-anomeric 

3 5 bridges, and borane derivatives. 

The great majority of these backbone modifications 
led to decreased stability for hybrids formed between the 
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modified oligonucleotide and its complementary native 
oligonucleotide, as assayed by measuring T m values. 
Consequently, it is generally understood in the art that 
backbone modifications destabilize such hybrids, i.e., result 
5 in lower T m values, and should be kept to a minimum. 



OBJECTS OP THE INVENTION 

It is one object of the present invention to 
provide compounds that bind ssDNA and RNA strands to form 
stable hybrids therewith. 
10 It is a further object of the invention to provide 

compounds that bind ssDNA and RNA strands . 

It is another object to provide compounds wherein 
naturally-occurring nucleobases or other nucleobase-binding 
moieties are covalently bound to a peptide backbone. 
15 It is yet another object to provide compounds other 

than RNA that can bind one strand of a double- stranded 
polynucleotide, thereby displacing the other strand. 

It is still another object to provide therapeutic, 
diagnostic, and prophylactic methods that employ such 
2 0 compounds. 

SUMMARY OF THE INVENTION 

The present invention provides a novel class of 
compounds, known as peptide nucleic acids (PNAs) , that bind 
complementary ssDNA and RNA strands. The compounds of the 

25 invention generally comprise ligands linked to a peptide 
backbone. Representative ligands include either the four 
main naturally occurring DNA bases (i.e., thymine, cytosine, 
adenine or guanine) or other naturally occurring nucleobases 
(e.g., inosine, uracil, 5-methylcytosine or thiouracil) or 

30 artificial bases (e.g., bromothymine, azaadenines or 

azaguanines, etc.) attached to a peptide backbone through a 
suitable linker. . 

In WO 92/20702, we described PNAs wherein such 
ligands are linked to a polyamide backbone solely through aza 

35 nitrogen atoms. The PNAs of the invention differ from those 



SNSDOCID: <WO 9<»2Sd77A2 I > 



WO 94/25477 PCT/IB94/00142 

- 5 - 

disclosed in WO 92/20702 principally in that their 
recognition moieties are linked to the poiyamide backbone 
additionally through amido and/cr ureido tethers. 

In certain preferred embodiments, the peptide 
5 nucleic acids of the invention have the general formula (I) : 



L 1 


L 2 


L n 


. ii 


i 2 


A n 




•"C 2 X D 2 


I 

---. c n v D n 


(i) 



wherein: 

n is at least 2, 

each of Lr-L n is independently selected from the 

10 group consisting of hydrogen, hydroxy, (C x -C 4 ) alkanoyl , 
naturally occurring nucleobases, non-naturally occurring 
nucleobases, aromatic moieties, DNA intercalators, 
nucleobase-binding groups, heterocyclic moieties, and 
reporter ligands, at least one of L x -L n being a naturally 

15 occurring nucleobase, a non-naturally occurring nucieobase, a 
DNA intercalator, or a nucleobase-binding group; 

each of C 1 -^ is (CR 6 R 7 ) y where R 6 is hydrogen and R 7 
is selected from the group consisting of the side chains of 
naturally occurring alpha amino acids, or R 6 and R 7 are 

20 independently selected from the group consisting of hydrogen, 
(C 2 -C 6 ) alkyl, aryl, aralkyl , heteroaryl, hydroxy, (C^- 
C 6 )alkoxy, (C^-Cg) alkyl thio, NR 3 R 4 and SR 5 , where R 3 and R 4 are 
as defined above, and R 5 is hydrogen, (C x -C s ) alkyl , hydroxy-, 
alkoxy-, or alkylthio- substituted (C-Cg) alkyl , or R 6 and R 7 

25 taken together complete an alicyclic or heterocyclic system; 

each of D x -D n is (CR 6 R 7 ), where R 6 and R 7 are as 
defined above; 

each of y and z is zero or an integer from 1 to 10, 
the sum y + z being greater than 2 but not more than 10; 

30 each of G l -G n ' 1 is -NR 3 C0- , -NR 3 CS- , -NR 3 S0- or - 

NR 3 S0 2 -, in either orientation, where R 3 is as defined above; 



SNSDOCID: <WO 9425477 A2J_> 



WO 94/25477 



PCT/IB94/00142 



- 6 - 

each pair of A : -A n and B 1 -B n are selected such that: 

(a) A is a group of formula (Ila) , (lib) or (lie) 
and B is N or R 3 N + ; or 

(b) A is a group of formula (I Id) and B is CH; 
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where : 

X is O, S, Se, NR 3 , CH 2 or C(CH 3 ) 2 ; 
Y is a single bond, 0, S or NR 4 ; 
10 each of p and q is zero or an integer from 1 

to 5, the sum p+q being not more than 10; 

each of r and s is zero or an integer from 1 
to 5, the sum r+s being not more than 10; 

each R 1 and R 2 is independently selected from 
15 the group consisting of hydrogen, (Ci-CJalkyl which 

may be hydroxy- or alkoxy- or alkylthio- 
substituted, hydroxy, alkoxy, alkylthio, amino and 
halogen; 

each of G 1 -G n ~ 1 is -NR 3 CO- , -NR 3 CS-, -NR 3 SO- or - 
2 0 NR 3 S0 2 -, in either orientation, where R 3 is as defined above; 

Q is -C0 2 H, -CONR'R'', -S0 3 H or -S0 2 NR'R'' or an 
activated derivative of -C0 2 H or -S0 5 H; and 

I is -NKR' ' 'R' ' ' ' or -NR' ' ' C (O) R' ' ' ' , where R' , R" , 
R' ' ' and R' ' ' ' are independently selected from the group 
25 consisting of hydrogen, alkyl , amino protecting groups, 
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reporter ligands, intercalators , chelators, peptides, 
proteins, carbohydrates, lipids, steroids, oligonucleotides 
and soluble and non-soluble polymers. 

In certain embodiments, at least one A is a group 
5 of formula (lie) and B is N or R 3 N*. In other embodiments, A 
is a group of formula (Ha) or (lib) , B is N or R 3 N\ and at 
least one of y or z is not 1 or 2 . 

Preferred peptide nucleic acids have general 
formula (Ilia) or (Illb) : 




10 



(Ilia) 




(Illb) 

wherein : 

each L is independently selected from the group 
consisting of hydrogen, phenyl, heterocyclic moieties, 
15 naturally occurring nucleobases, and non-naturally occurring 
nucleobases ; 
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each R 7 ' is independently selected from the group 
consisting of hydrogen and the side chains of naturally 
occurring alpha amino acids; 

n is an integer from 1 to 60; 
5 each of k, 1, and m is independently zero or an 

integer from 1 to 5 ; 

p is zero or 1; 

R h is OH, NH 2 or -NHLysNH 2 ; and 

R 1 is H or COCH 3 . 

10 Particularly preferred are compounds having formula (Ilia) or 
(Illb) wherein each L is independently selected from the 
group consisting of the nucleobases thymine (T) , adenine (A) , 
cytosine (C) , guanine (G) and uracil (U) , k and m are zero or 
1, and n is an integer from 1 to 30, in particular from 4 to 

15 20 . 

The peptide nucleic acids of the invention are 
synthesized by adaptation of standard peptide synthesis 
procedures, either in solution or on a solid phase. The 
synthons used are specially monomer amino acids or their 

2 0 activated derivatives, protected by standard protecting 

groups. The oligonucleotide analogs also can be synthesized 
by using the corresponding diacids and diamines. 

Thus, the novel monomer synthons according to the 
invention are selected from the group consisting of amino 

25. acids, diacids and diamines having general formulae: 
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(IV) 
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(VI) 





wherein L, A, E, C and D are as defined above, except that 
any amino groups therein may be protected by amino protecting 
groups; E is COOH, CSOH, SOOH, S0 2 OH or an activated 
3 0 derivative thereof; and F is NHR 3 or NPgR 3 , where R 3 is as 
defined above and Pg is an amino protecting group. 
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Preferred monomer synthons according to the 
invention have formula (Villa) - (VI lie) : 



(Villa) 





(VHIb) 




(VTIlc) 
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or amine -protect ed and/or acid terminal activated derivatives 
thereof, wherein L is selected from the group consisting of 
hydrogen, phenyl, heterocyclic moieties, naturally occurring 
nucleobases, and non-naturally occurring nucleobases; and R 7 ' 
5 is selected from the group consisting of hydrogen and the 
side chains of naturally occurring alpha amino acids. 

Unexpectedly, these compounds also are able to 
recognize duplex DNA by displacing one strand, thereby 
presumably generating a double helix with the other one. 

10 Such recognition can take place to dsDNA sequences 5-60 base 
pairs long. Sequences between 10 and 2 0 bases are of 
interest since this is the range within which unique DNA 
sequences of prokaryotes and eukaryotes are found. Reagents 
which recognize 17-18 bases are of particular interest since 

15 this is the length of unique sequences in the human genome. 
The compounds of the invention are able to form triple 
helices with dsDNA and double helices with RNA or ssDNA. The 
compounds of the invention also are able to form triple 
helices wherein a first PNA strand binds with RNA or ssDNA 

2 0 and a second PNA strand binds with the resulting double helix 

or with the first PNA strand. 

Whereas the improved binding of the compounds of the 
invention should render them efficient as antisense agents, 
it is expected that an extended range of related reagents may 
25 cause strand displacement, now that this surprising and 
unexpected new behavior of dsDNA has been discovered. 

Thus, in one aspect, the present invention provides 
methods for inhibiting the expression of particular genes in 
the cells of an organism, comprising administering to said 

3 0 organism a reagent as defined above which binds specifically 

to sequences of said genes . 

Further, the invention provides methods for 
inhibiting transcription and/or replication of particular 
genes or for inducing degradation of particular" regions of 
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double stranded DNA in cells of an organism by administering 
to said organism a reagent as defined above. 

Still further, the invention provides methods for 
killing cells or virus by contacting said cells or virus with 
5 a reagent as defined above which binds specifically to 
sequences of the genome of said cells or virus. 

DETAILED DESCRIPTION OF THE INVENTION 

In the oligonucleotide analogs and monomer synthons 
according to the invention, ligand L is primarily a naturally 

10 occurring nucleobase attached at the position found in 

nature, i.e., position 9 for adenine or guanine, and position 
1 for thymine or cytosine . Alternatively, L may be a non- 
naturally occurring nucleobase (nucleobase analog) , another 
base-binding moiety, an aromatic moiety, (C^-C^) alkanoyl , 

15 hydroxy or even hydrogen. It will be understood that the 
term nucleobase includes nucleobases bearing removable 
protecting groups. Some typical nucleobase ligands and 
illustrative synthetic ligands are shown in Figure 2 of WO 
92/20702. Furthermore, L can be a DNA intercalator , a 

20 reporter ligand such as, for example, a fluorophor, radio 
label, spin label, hapten, or a protein-recognizing ligand 
such as biotin. In monomer synthons, L may be blocked with 
protecting groups, as illustrated in Figure 4 of WO 92/20702. 
Linker A can be a wide variety of groups such as 

25 -CRH^CO-, -CR^CS-, -CR l R 2 CSe- # -CR 1 R 2 CNHR 3 - , -CR 1 R 2 C=CH 2 - and 
-CR l R 2 C-C(CH 3 ) 2 -, where R 1 , R 2 and R 3 are as defined above. 
Preferably, A is methylenecarbonyl (-CH 2 CO-) , amido (-C0NR 3 - 
) , or ureido ( -NR 3 CONR 3 - ) . Also, A can be a longer chain 
moiety such as propanoyl, butanoyl or pentanoyl, or 

30 corresponding derivative, wherein O is replaced by another 
value of X or the chain is substituted with R X R 2 or is 
heterogenous, containing Y. Further, A can be a (C 2 - 
C 6 )alkylene chain, a (C 2 -C 6 ) alkylene chain substituted with 
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R 2 R 2 or can be heterogenous, containing Y. In certain cases, 
A can just be a single bond. 

In one preferred form of the invention, B is a 
nitrogen atom, thereby presenting the possibility of an 
5 achiral backbone. B can also be R 3 N + , where R 3 is as defined 
above, or CH. 

In the preferred form of the invention, C is -CR 6 R 7 - ■ 
, but can also be a two carbon unit, i.e. -CHR 6 CHR 7 - or 
-CR 6 R 7 CH 2 -, where R 6 and R 7 are as defined above. R 6 and R 7 

10 also can be a heteroaryl group such as, for example, 

pyrrolyl, furyl, thienyl, imidazolyl, pyridyl, pyrimidinyl, 
indolyl, or can be taken together to complete an alicyclic 
system such as, for example, 1 , 2 -cyclobutanediyl , 1,2- 
cyclopentanediyl or 1, 2-cyclohexanediyl . 

15 In the preferred form of the invention, E in the 

monomer synthon is COOH or an activated derivative thereof, 
and G in the oligomer is -CONR 3 -. As defined above, E may 
also be CSOH, SOOH, S0 2 OH or an activated derivative thereof, 
whereby G in the oligomer becomes -CSNR 3 -, -SO£TR 3 -and -S0 2 NR 3 - 

2 0 , respectively. The activation may, for example, be achieved 

using an acid anhydride or an active ester derivative, 
wherein hydrogen in the groups represented by E is replaced 
by a leaving group suited for generating the growing 
backbone . 

25 The amino acids which form the backbone may be 

identical or different. We have found that those based on 2- 
aminoethylglycine are especially well suited to the purpose 
of the invention. 

In some cases it may be of interest to attach 

3 0 ligands at either terminus (Q, I) to modulate the binding 

characteristics of the PNAs . Representative ligands include 
DNA intercalators which will improve dsDNA binding or basic 
groups, such as lysine or polylysine, which will strengthen 
the binding of PNA due to electrostatic interaction. To 
3 5 decrease negatively charged groups such as carboxy and sulfo 
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groups could be used. The design of the synthons further 
allows such other moieties to be located on non-terminal 
positions . 

In a further aspect of the invention, the PNA 
5 oligomers are conjugated to low molecular effector ligands 
such as ligands having nuclease activity or alkylating 
activity or reporter ligands (fluorescent, spin labels, 
radioactive, protein recognition ligands, for example, biotin 
or haptens) . In a further aspect of the invention, the PNAs 

10 are conjugated to peptides or proteins, where the peptides 
have signaling activity and the proteins are, for example, 
enzymes, transcription factors or antibodies. Also, the PNAs 
can be attached to water-soluble or water- insoluble polymers. 
In another aspect of the invention, the PNAs are conjugated 

.15 to oligonucleotides or carbohydrates. When warranted, a PNA 
oligomer can be synthesized onto some moiety (e.g., a peptide 
chain, reporter, intercalator or other type of ligand- 
containing group) attached to a solid support. 

Such conjugates can be used for gene modulation 

20 (e.g., gene targeted drugs), for diagnostics, for 
biotechnology, and for scientific purposes. 

As a further aspect of the invention, PNAs can be 
used to target RNA and ssDNA to produce both ant isense -type 
gene regulating moieties and hybridization probes for the 

25 identification and purification of nucleic acids. 

Furthermore, the PNAs can be modified in such a way that they 
can form triple helices with dsDNA. Reagents that bind 
sequence-specif ically to dsDNA have applications as gene 
targeted drugs. These are foreseen as extremely useful drugs 

3 0 for treating diseases like cancer, AIDS and other virus 
infections, and may also prove effective for treatment of 
some genetic diseases. Furthermore, these reagents may be 
used for research and in diagnostics for detection and 
isolation of specific nucleic acids. 
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The triple helix principle is believed to be the 
only known principle in the art for sequence- specif ic 
recognition of dsDNA. However, triple helix formation is 
largely limited to recognition of homopurine-homopyrimidine 
5 sequences . Strand displacement is superior to triple helix 
recognition in that it allows for recognition of any sequence 
by use of the four natural bases. Also, in strand 
displacement recognition readily occurs at physiological 
conditions, that is, neutral pH, ambient (20-40 C) 

10 temperature and medium (100-150 mM) ionic strength. 

Gene targeted drugs are designed with a nucleobase 
sequence (containing 10-20 units) complementary to the 
regulatory region (the promoter) of the target gene. 
Therefore, upon administration of the drug, it binds to the 

15 promoter and block access thereto by RNA polymerase. 

Consequently, no mRNA, and thus no gene product (protein) , is 
produced. If the target is within a vital gene for a virus, 
no viable virus particles will be produced. Alternatively, 
the target could be downstream from the promoter, causing the 

2 0 RNA polymerase to terminate at this position, thus forming a 
truncated mRNA/protein which is nonfunctional. 

Sequence- specif ic recognition of ssDNA by base 
complementary hybridization can likewise be exploited to 
target specific genes and viruses. In this case, the target 

25 sequence is contained in the mRNA such that binding of the 
drug to the target hinders the action of ribosomes and, 
consequently, translation of the mRNA into protein. The 
peptide nucleic acids of the invention are superior to prior 
reagents in that they have significantly higher affinity for 

30 complementary ssDNA. Also, they possess no charge and water 
soluble, which should facilitate cellular uptake, and they 
contain amides of non-biological amino acids, which should 
make them biostable and resistant to enzymatic degradation 
by, for example, proteases. 
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It is believed that PNA oligomers according to the 
invention exhibit biochemical/biological properties similar 
to those disclosed in WO 92/20702, and that such properties 
can be determined by similar means. It also is believed that 
5 the PNAs of the invention can be synthesized by similar 

methodology. Monomer synthons according to the invention are 
coupled using the standard protocols to give the desired 
oligomeric sequences. 

One monomer synthon according to the invention is 

10 prepared by reacting glycinamide hydrochloride 1 with ethyl 
acrylate in the presence of an acid scavenging base to give 
the Michael adduct, N-carboxamidomethyl-£- alanine ethyl ester 
2. The adduct 2 is condensed with 1-carboxymethyl thymine 3 
using diisopropylcarbodiimide and hydroxybenzotriazole to 

15 give (N-carboxamidomethyl ) -N- (1- (thymin-l-yl) acetyl) 

alanine ethyl ester 4. The primary amide of 4 is oxidized 
and rearranged to the Boc -protected amine with sodium 
hypobromite in t-butanol to provide (N-t- 

butyloxycarbonylaminomethyl) -N- (1- (thymin-l-yl) acetyl) -0- 

2 0 alanine ethyl ester 5. The ethyl ester is hydrolyzed with 

aqueous base to provide the thymine-based monomer, (N-t- 
butyloxycarbonylaminomethyl) -N- (1- (thymin-l-yl) acetyl) -0- 
alanine 6. This reaction sequence is followed to prepare the 
corresponding C, G, and A -based monomers, namely, N- (t- 
25 butyloxycarbonylaminomethyl) -N- (1- (N 4 -benzyloxycarbonyl- 
cytosin-l-yl) acetyl) -^-alanine, N- (t-butyloxycarbonyl- 
aminomethyl-N- (1- (2-amino-6-benzyloxy-purin-9-yl) acetyl) -(3- 
alanine, N- (t -butyloxycarbonylaminomethyl) -N- (1- (N 6 - 
benzyloxycarbonyl - adenine - 9 -yl ) acetyl ) - /3 - alanine . 

3 0 A further monomer synthon is prepared by reacting 

1-aminothymine with triphosgene to give the carbamoyl 
chloride derivative, j3, which is condensed with N- (2-t- 
butyloxycarbonylaminoethyl) glycine ethyl ester and an acid 
scavenger to yield the fully protected monomer, 9. The ester 
3 5 is hydrolyzed to give the useful monomer, 1£. This reaction 
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sequence is followed to prepare the corresponding C, G, and A 
-based monomers, namely, N- (t-butyloxycarbonylaminoethyl) -N- 
( 1 -N 4 -benzyloxycarbonyl - cytos in- 1 -yl ) aminocarbonyl ) - glyc ine , 
N- (t-butyloxycarbonylaminoethyl) -N- (1- ( 2 -amino- 6 -benzyl oxy- 
5 purin-9-yl) - aminocarbonyl) -glycine, N- (t -butyl oxycarbonyl- 
amino ethyl) -N- (1- ( N s -benzyl oxycarbonyl- adenine- 9- 
yl) aminocarbonyl) -glycine. 

A further monomer synthon is prepared by converting 
2-hydroxy-5- (t -butyl oxycarbonylamino ) pentanoic acid ethyl 
10 ester to its azido analog via the use of diphenyl phosphoryl 
azide, DEAD, and triphenylphosphine generally by the 
procedure described in Tetrahedron Letters, (1977) , p. 1977 . 
The azido compound, 12., was converted to the 

iminophosphorane, 13_, and used immediately in a high pressure 
15 reaction with carbon dioxide to convert it into isocyanate , 
14. The isocyanate is condensed with thymine to give the 
fully protected monomer, 15., which is hydrolyzed to the 
actual monomer, .16., using hydroxide. This reaction sequence 
is followed to prepare the corresponding C, G, and A -based 

2 0 monomers, namely, 5- ( t-butyloxycarbonylamino) -2- ( (N 4 - 

benzyloxycarbonyl-cytosin-l-yl) carbonylamino-pentanoic acid 
ethyl ester, 5- ( t -butyloxycarbonylamino-2- ( (2-amino-6- 
benzyloxy-purin-9-yl) carbonyl amino) -pentanoic acid ethyl 
ester, 5- (t-butyloxycarbonylamino) -2- ( (N 6 -benzyloxycarbonyl- 
25 adenine - 9 -yl ) carbonylamino) -pentanoic acid ethyl ester. 

Additional objects, advantages, and novel features 
of this invention will become apparent to those skilled in 
the art upon examination of the following examples thereof, 
which are not intended to be limiting. 

3 0 Example 1 

N- car boxamidome thy 1-/3- alanine ethyl ester, 2. 

Glycinamide hydrochloride (1, 11.0 g, 0.10 mcl) is 
suspended in 500 mL of dioxane and diisopropylethylamine 
(12.9 g, 0.10 mol) is added and the. mixture cooled to 0°C. 
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With stirring ethyl acrylate (10.0 g, 0.10 mol) is added 
dropwise over 15 minutes. After the addition is complete the 
reaction is allowed to warm to room temperature and stir for 
12 hours. The reaction mixture is diluted with water 1.5 L 
5 and the pH adjusted to 4. The solution is extracted with 
diethyl ether (3X3 00 mL) . The aqueous layer is neutralized 
with sodium hydroxide and extracted 5 times with 
dichloromethane . The dichloromethane extracts are combined, 
dried (Na 2 S0 4 ) , and the solvent removed to give a solid. 

10 Example 2 

(N-carboxamidomethyl) -N- (1- ( thymin-l-yl) acetyl) -/S- alanine 
ethyl ester, 4. 

The product from Example 1, 2, is dissolved in 
dichloromethane (500 mL) and to this is added 1-carboxymethyl 

15 thymine (3, 15.5 g, 0.1 mol), hydroxybenzotriazole (13.5 g, 
0.1 mol) and the solution is cooled to 0°C in an ice bath. 
Diisopropylcarbodiimide (12.6 g, 0.1 mol) dissolved in 50 mL 
of dichloromethane is added in one portion and the reaction 
is stirred for 12 hours. The suspended solids are removed by 

20 filtration and washed with dichloromethane. The solution is 
evaporated to a solid and the desired product, 4, is obtained 
after chromatography on silica gel using 
dichloromethane/ethanol as eluent. 

Example 3 

25 (N-t-butyloxycarbonylaminomethyl) -N- (1- ( thymin- 1-yl) acetyl) - 
alanine ethyl ester, 5. 

The product from Example 2,4, is dissolved t- 
butanol/dioxane (4:1, 500 mL) , cooled to 0°C, and sodium 
hypobromite solution (0.15 mol) is added. After 6 hours the 
3 0 reaction mixture is evaporated to remove volatile solvents 
and the residue is diluted with water (500 mL) and extracted 
(5X200 mL) with dichloromethane. The extracts are combined, 
dried, and evaporated to a solid. 



BNSDOC1D: <WO 9-425477 A2 I ^> 



WO 94/25477 



PCT/IB94/00142 



- 18 - 

Example 4 

(N-t-butyloxycarbonylaminomethyl) -N- (1- (thymin-l-yl) acetyl) - 

0- alanine, 6. 

The product from Example 3, 5, is dissolved in 
5 ethanol (500 mL) and 5M sodium hydroxide (2 0 mL) is added. 
The solution is stirred for 6 hours, then neutralized with 5N 
hydrochloric acid (20 mL) and the solution evaporated to a 
solid. This solid is recrystallized to give the title 
compound. 

10 Example 5 

1- (Chlorocarbonylamino) - thymine, J3. 

1 -Amino thymine (7, 12.5 g, 0.1 mol) is dissolved 
in tetrahydrofuran (500 mL) and the solution is cooled to 0°C 
and a 2M solution of triphosgene in THF (150 mL) is added and 
15 the reaction is stirred for 4 hours. The solution is 
evaporated to a solid, which is used as is in the next 
reaction. 

Example 6 

N- (2-t-Butyloxycarbonylaminoethyl) -N- (thymin-l-yl- amino - 
2 0 carbonyl) glycine ethyl ester, 9. 

The product from Example 5, 8., is dissolved in THF 
(500 mL) and dii s op ropyl ethyl amine (12.9 g, 0.1 mol) is 
added, followed by N- (2-t-butyloxycarbonylaminoethyl) glycine 
ethyl ester (24.6 g, 0.1 mol) and the solution stirred for 12 
25 hours. The reaction is diluted with 1000 mL of diethyl ether 
and extracted 3 times with 0 . IN HC1 solution. The organic 
layer is washed with diluted sodium bicarbonate solution, 
dried, filtered and evaporated to give a solid. 

Example 7 

30 N- (2 - 1 -Butyl oxy carbonyl amino ethyl) -N- ( thymin- 1-yl- amino - 
carbonyl) glycine, 10, 

The product from Example 6, £, is dissolved in 
ethanol (500 mL) and 2M sodium hydroxide (50 mL) is added. 
The reaction is stirred for 6 hours, then neutralized with 50 
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mL. of 2M KC1 solution, and evaporated to remove the ethanol. 
The residue is dissolved in dichloromethane (250 mL) and is 
extracted with water (2X50 mL) , dried, filtered, and 
evaporated to a solid. 

5 Example 8 

2-Azido-5- (t-butyloxycarbonylamino) pentanoic acid ethyl 
ester, 12. 

2-Hydroxy-5- ( t-butyloxycarbonylamino) pentanoic acid 
ethyl ester, ( 11 , 26.1 g, 0.1 mol) , triphenylphosphine (26.2 

10 g, 0.10 mol), diethylazodicarboxylate (17.4 g, 0.1 mol), and 
diphenylphosphorylazide (27.5 g, 0.1 mol) is dissolved in THF 
(500 mL) and heated to reflux and maintained there for 8 
hours. The reaction is cooled to room temperature, 
evaporated to an oil, and the product isolated by column 

15 chromatography using dichloromethane : ethanol as eluent . 

Example 9 

2 - Iminotriphenylphosphoranyl - 5 - ( t- 

butyloxycarbonyl amino) pentanoic acid ethyl ester, 12. • 

The product, 12, from Example 8 is dissolved in THF 
20 and triphenylphosphine (26.2 g, 0.1 mol) is added and the 

reaction is stirred for 4 hours. This solution is used as is 
for the next reaction (Example 10) . 

Example 10 

2 -Isocyanato-5- (t-butyloxycarbonylamino) pentanoic acid ethyl 
25 ester, 14. 

The reaction solution from Example 9 is placed in a 
Parr® bomb and carbon dioxide (22 g, 0.5 mol) is condensed 
into the bomb. The bomb is sealed and heated to 50°C for 12 
hours. The bomb is cooled and vented to atmospheric 
30 pressure. The solution is transferred from the bomb to a 
flask and used as is in the next reaction (Example 11) . 
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Example 11 

2- (Thymin-l-ylcarbonylamino) -5- (t- 

butyloxycarbonylamino)pentanoic acid ethyl ester, 15. 

The reaction solution from Example 10 is placed in 
5 a flask and to this is added thymine (12.6 g, 0.1 mol) . The 
resulting solution is allowed to stir for 12 hours, then is 
evaporated to a solid, which is purified by column 
chromatography using dichlormethane : ethanol as the eluent. 

Example 12 

10 2- (Thymin-l-ylcarbonylamino) -5- (t-butyloxycarbonyl- 
amino) pentanoic acid, 16 . 

The product from Example 11, 15, is dissolved in 
ethanol (500 mL) and to this added 2M sodium hydroxide (50 
mL) and the reaction stirred for 12 hours. The reaction is 
15 neutralized with 2M HC1 solution (50 mL) and evaporated to a 
small volume. This residue is diluted with water (250 mL) 
and extracted with dichloromethane (4X10 0 mL) , dried, 
filtered, and evaporated to give a solid. 

Example 13 

20 1- (2 (-Thyminyl) acetyl) -1- (2- (tBoc-aminopropyl) ) glycine, 17 

1, 3-Diaminopropane (0.05 mmol) was dissolved in THF 
(100 mL) and chloroacetic acid (0.045 mmol) was added and the 
reaction heated at reflux for 4 hours and cooled to room 
temperature. The solution was diluted with diethyl ether 

25 (500 mL) and extracted 3 times with IN NaOH solution. The 
combined water layers were acidified to pH = 4 and extracted 
with dichloromethane (5X5 0 mL) . The organic layers were 
combined, dried, filtered and evaporated to an oil. This oil 
was dissolved in methanol (1000 mL) and dry HC1 gas added. 

3 0 The reaction was heated to reflux and maintained there for 8 
hours. The reaction was cooled and evaporated to an oil. 
This oil was dissolved in dioxane/water and p-nitrophenyl - 1- 
butylcarbonate (0.05 mmol) was added and the pH adjusted to 
10. The reaction was stirred for 4 hours, then neutralized 

35 and extracted 5 times with dichloromethane. The methyl ester 
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was dissolved in 50% DMF in dichloromethane and to this was 
added dicyclohexylcarbodiimide (DCC, 0.05 mmol) and 
hydroxbenzotriazole (0.05 mmol), and 2 -thyminylacetic acid 
(0.05 mmol) . The reaction was stirred for 18 hours then the 
5 DCC was removed by filtration and the residue evaporated to 
an oil. The oil was purified by column chromatography. 

Example 14 

3- (Boc -amino) -1, 2 -propanediol, 18 

3 - Amino -1, 2 -propanediol (40.00 g, 0.440 mol, 1.0 

10 eqv) was dissolved in water (1000 ml) and cooled to 0 °C, and 
di-tert-butyl dicarbonate (115.0 g, 0.526 mol , 1.2 eqv) was 
added in one portion. The reaction mixture was heated to 
room temperature on a water bath with stirring. The pH was 
maintained at 10.5 with a solution of sodium hydroxide (17.56 

15 g, 0.440 mol, 1.0 eqv) in water (120 ml). When the addition 
of aqueous sodium hydroxide was completed, the reaction 
mixture was stirred overnight at room temperature. 
Subsequently, ethyl acetate (750 ml) was added to the 
reaction mixture followed by cooling to 0°C and the pH was 

20 adjusted to 2 . 5 with 4N sulfuric acid with vigorous stirring. 
The phases were separated. The water phase was washed with 
additional ethyl acetate (6x3 50 ml) . The volume of the 
organic phase was reduced to 900 ml by evaporation under 
reduced pressure and washed with a saturated aqueous solution 

25 of potassium hydrogen sulfate diluted to twice its volume 

(1x1000 ml) and with saturated aqueous sodium chloride (1x500 
ml) . The organic phase was dried (MgSOj and evaporated 
under reduced pressure to yield 50.12 9 (60%) of the title 
compound. The product could be solidified by evaporation 

3 0 from methylene chloride and subsequent freezing. X H-NMR 

(CDCI3/TMS) : 5 = 1.43 (s, 9H, Me 3 C) , 3.25 (m, 2H, CH2) , 3.57 
(m, 2H, CH 2 ) , 3.73 (m, 1H, CH) . 13 C-NMR (CDC1 3 /TMS): 5 = 28.2 
(Me 3 C) , 42.6 (CH 2 ) , 63.5, 71.1 (CH 2 0H, CHOH) , 79.5 (Me 3 C) , 
157.0 (C=0) . 
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Example 15 

Boc - aminoac e tal dehyde , 19 

3- (Boc- amino ) -1, 2 -propanediol (18, 20.76 g, 0.109 
mol, 1 eqv) was suspended in water (150 ml) . Potassium m- 
5 periodate (24.97 g, 0.109 moi, 1 eqv) was added and the 
reaction mixture was stirred for 2 h at rpom temperature 
under nitrogen. The reaction mixture was filtered and the 
water phase was extracted with chloroform (6x250 ml) . The 
organic phase was dried (MgS0 4 ) and evaporated to afford 

10 crude Boc -aminoacet aldehyde as a golden oil. This oil was 
kugelrohr distilled at 80°C and 0.2 mbar to yield 13.19 g 
(76%) of the title compound as a semicrystalline solid. ^-HNMR 
(DMSO-d 6 /TMS) : 5 = 1.47 (s, 9H, Me 3 C) , 3.81 (d, J=5 . 6 Hz, 2H, 
CH 2 ) , 7.22 (b, 1 H, NH) , 9.54 (s, 1 H, CHO) . 13 C-NMR (DMSO- 

15 d 6 /TMS) : 6 = 28.2 (Me 3 C) , 50.5 (CH 2 ) , 78.4 (Me 3 C) , 156.1 

(carbamate C=0) , 200.6 (CHO). Anal. Calcd. for C 7 H 13 N0 3 : C, 
52.82; H, 8.23; N, 8.80. Found: C, 52.21; H, 8.15; N, 8.46. 

Example 16 

(Boc -amino) ethyl glycine Methyl Ester, 20 

2 0 A. Reduction With Sodium Cyanoborohydride 

Boc -aminoacet aldehyde (19, 1.00 g, 6.3 mmol, 1 eqv) 
was dissolved in methanol (50 ml) . Anhydrous sodium acetate 
(1.03 g, 12.6 mmol, 2 eqv), glycine methyl ester 
hydrochloride (Aldrich Chemical Co., 0.79 g, 6.3 mmol, 1 eqv) 
25 and sodium cyanoborohydride (1.97 g, 31.4 mmol, 5 eqv) were 
added to the solution in that order. The reaction mixture 
was stirred for 2 h at room temperature under nitrogen. 
Water (50 ml) was added to the suspension and the resulting 
clear solution was evaporated under reduced pressure to 

3 0 remove the methanol. The aqueous phase was extracted with 

methylene chloride (3x100 ml) . The organic phase was washed 
with a saturated aqueous solution of sodium chloride (1x10 0 
ml) , dried (Na 2 S0 4 ) , filtered and then evaporated under 
reduced pressure affording 1.41 g of crude title compound as 
3 5 a yellow oil. The crude product was kugelrohr distilled at 
110°C and 0.5 mbar to yield 0.49 g (34%) of 2- (Boc- 
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amino) ethylglycine methyl ester as a colorless liquid. X H- 
NMR (CDCI3/TMS) : 6 =1.36 (s, 9H, Me 3 C) , 1.91 (s, 1H # NH) , 
2.67 (t, J=6 Hz, 2H, NHCH 2 ) , 3.13 (q, J=6 Hz, 2H, NHCH 2 ) , 
3.34 (s, 2H, CH 2 COO), 3.65 (s, 3H, OMe) , 5.13 (b, 1H, 
5 carbamate NH) . 13 C-NMR (CDCl 3 /TMS) : 5 = 28.2 (Me 3 C) , 39.9, 
48.5 (NHCH 2 ) , 50.0 (CH 2 C00) , 51.5 (OMe), 78.9 (Me 3 C) , 155.9 
(carbamate C=0) , 172.6 (ester 0=0) . Anal. Calcd for C 10 H 2 N 2 0 4 : 
C, 51.71; H, 8.68; N, 12.06. Found: C, 51.55; H, 8.72; N, 
11.79 . 

10 B. Catalytic hydrogenation 

Boc-aminoacet aldehyde (2.08 g, 13.1 mmol, 1 eqv) 
was dissolved in methanol (50 ml) and cooled to 0 °C. 
Palladium on activated carbon (10%, 0.4 g) was added under 
nitrogen and with vigorous stirring. Anhydrous sodium 

15 acetate (2.14 g, 26.1 mmol, 2 eqv) and glycine methyl ester, 
hydrochloride (1.64 g, 13.1 mmol, 1 eqv) each dissolved in 
methanol (25 ml) were added to the mixture. The reaction 
mixture was hydrogenated at atmospheric pressure and room 
temperature with vigorous stirring, until hydrogen uptake had 

20 ceased (when 287 ml, 13.1 mmol, 1 eqv had been consumed) 

after about 1 h. The reaction mixture was filtered and the 
solvent removed under reduced pressure. The residue was 
suspended in water (3 0 ml) , and during vigorous stirring pH 
was adjusted to 8 by dropwise addition of 0.5 N NaOH . The 

25 water phase was extracted with methylene chloride (4x5 0 ml) . 
The organic phase was dried (Na 2 S0 4 ) , filtered and evaporated 
under reduced pressure to yield 3.03 g of crude title 
compound as a golden oil. The crude product was kugelrohr 
distilled at 100 °C and 0.2 mbar to afford 2.33 g (77%) of 2- 

30 (Boc-amino) ethylglycine methyl ester as a colorless liquid. 
The analytical data were in accord with those provided above 
for the reduction with sodium cyanoborohydride . 
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Example 17 

General Method for the Synthesis of PNA Oligomers 

Oligomers were prepared generally in accordance 
with the methods disclosed by WO 92/20702, Benzyhydryl amine 
5 resin (initially loaded 0.28 mmol/gm with Boc-L-Lys (2- 

chlorobenyloxycarbonyl) ) was swollen in DMF and an excess of 
a monomer to be coupled was added, followed by dicyclohexyl- 
carbodiimide (0.15M in 50% DMF in dichloromethane) . The Boc 
deprotection was accomplished by trif luoroacetic acid 

10 treatment. The progress of the coupling reactions was 

monitored by quantitative ninhydrin analysis. The PNA was 
released from the resin using anhydrous HF under standard 
conditions. The products were purified using KPLC with 
acetonitrile-water (0.1%TFA) gradient and structure confirmed 

15 by fast atom bombardment mass spectrometry. The following 
sequences have been synthesized by this method: 

H-T 10 LysNH 2 

H-T 4 CT 5 LysNH 2 

H-T 2 CT 2 CT 4 LysNH 2 
20 H-T 4 CT 2 CT 2 LysNH 2 

H - TGTACGTCACAACTA - NH 2 

H-CCTTCCCTT-NH 2 

H-TTCCCTTCC-NH 2 

H - TAGTTAT CTCTATCT - NH 2 
25 H- TGTACGTCACAACTA -NH 2 

H-GCACAGCC-LYS-NH 2 

H - TTTTCTTTT - NH 2 

H - TTTTTTTTTCCCCCCC - NH 2 

H-CCCCCCCTTTTTTTTT-NH 2 
3 0 H - CCT CCTTCCC -NH 2 

H - TTCTCTCTCT - NK 2 

H - TTTTTCTCTCTCTCT - NH 2 

H-CCCCCACCACTTCCCCTCTC- (Lys) 9 NK 2 

H- CTTATATTCCGTCATCGCTCLys -NH 2 
3 5 H- CTGTCTCCATCCTCTTCACT - NH 2 

K-TATTCCGTCATCGCTCCTCALys-NH 2 
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H~CCCCCACCACTTCCCCTCTC-NH 2 
H- CTGCTGCCTCTGTCTCAGGTLysNH 2 
H-T 4 - (^-alanine) C-T 5 LysNK 2 
H-T 4 - (/3-alanine)T-T 5 LysNH 2 

5 Those skilled in the art will appreciate that 

numerous changes and modifications may be made to the 
preferred embodiments of the invention and that such changes 
and modifications may be made without departing from the 
spirit of the invention. It is therefore intended that the 
10 appended claims cover all such equivalent variations as fall 
within the 'true spirit and scope of the invention. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Nielsen, Peter E . 

Buchardt, 01 e 
Egholm, Michael 
Berg, Rolf H. 

(ii) TITLE OF INVENTION: Novel Peptide Nucleic Acids 
(iii) NUMBER OF SEQUENCES: 24 
(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Woodcock, Washburn, Kurtz, Mackiewicz, and 

Norris 

(B) STREET: One Liberty Place - 46th Floor 

(C) CITY: Philadelphia 

(D) STATE: PA 

(E) COUNTRY: USA 

(F) ZIP: 19103 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 
(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 
(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Lucci, Joseph 

(B) REGISTRATION NUMBER: 33,307 

(C) REFERENCE / DOCKET NUMBER: ISIS1017 
(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 215-568-3100 

(B) TELEFAX: 215-568-3439 
(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME / KEY : Modified- site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /label= MODIFIED- SITE 

/note= "Thymine heterocycle is attached to 
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N- acetyl (2 -aminoethyl) glycine through the 
N-acetyl group at position 1 of the heterocycle . " 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /label* Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /lahel= Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME / KEY : Modif ied-site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /label= Modif ied- site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position l of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /label* Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /labels Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position l of the heterocycle . " 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /labels Modif ied- site 

/note* "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 
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(A) NAME / KEY : Modif ied-site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /label* Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl {2-aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle . " 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 9 

<D) OTHER INFORMATION: /label* Modif ied-site 

/note* "Thymine heterocycle is attached to 
N-acetyl (2- aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /label* Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Lys 
15 10 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /label* Modif ied-site 

/note* "Thymine heterocycle is attached to 
N-acetyl (2-aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE : 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /label* Modif ied- site 

/note* "Thymine heterocycle is attached to 
N-acetyl (2-aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 
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(ix) FEATURE: 

(A) NAME / KEY : Modif ied-site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /label- Modif ied-site 

/note= "Thymine heterocycle is attached to 

N- acetyl (2 -aminoethyl) glycine through the N-acetyl 

group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /labels Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 
<B) LOCATION: 5 

(D) OTHER INFORMATION: /label- Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at postion 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2-aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /label- Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied- site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /label* Modif ied-site 

/notes "Thymine heterocycle is attached to 
N-acetyl (2-aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied- site 

(B) LOCATION: 9 
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(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoe thy 1) glycine through the N-acetyl 
group at position 1 of the heterocycle." 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /label* Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2- aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(xi) SEQUENCE DESCRIPTION : SEQ ID N0:2: 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Lys 
15 10 
(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 

(ix) FEATURE : 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /labels Modif ied-site 



/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 



(ix) 



FEATURE : 



(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /label* Modified- site 



/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 



(ix) 



FEATURE : 



(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: / label = Modif ied- site 



/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 



(ix) 



FEATURE : 



(A) NAME /KEY : Modif ied-site 



SNSDCCID: <WO 9425477A2 t > 



WO 94/25477 



PCT7IB94/00142 



- 31 - 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /label= Modified- site 

/note= "Thymine heterocycle is attached to 

N- acetyl (2- aminoethyl) glycine through the N-acetyl 

group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /label* Modif ied- site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle . 11 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /label* Modif ied-site 

/note* "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME / KEY : Modif ied-site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /label- Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME / KEY : Modif ied-site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /label- Modif ied- site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE : 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /label* Modified- site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME / KEY : Modified- site 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /label* Modified- site 

/note= "Thymine heterocycle is attached to 
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N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle . " 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Lys 
15 10 
(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME / KEY : Modified- site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /label* Modified- site 

/note* "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note* "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 
<B) LOCATION: 4 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

( ix) FEATURE : 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /label* Modif ied-site 
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/note* "Cytosine heterocycle is attached to 
N-acetyl (2- aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) . LOCATION: 8 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /label= Modif ied- site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Lys 
15 10 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 
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(3) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /label- Modif ied-site 

/note= "Thymine heterocycle is attached to 

N- acetyl (2 -amino ethyl) glycine through the N-acetyl 

group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Guanine heterocycle is attached to 
N-acetyl (2 -arainoethyl) glycine through the N-acetyl 
group at position 9 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: / label = Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

■ (ix) FEATURE: 

(A) NAME / KEY : Modif ied-site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Adenine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 9 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: / label = Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied- site 

(B) LOCATION: 6 

(B) OTHER INFORMATION: /label= Modif ied- site 

/note= "Guanine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
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croup at position 9 of the heterocycle . " 

(ix) FEATURE: 

(A) NAME / KEY : Modif ied-site 
(E) LOCATION: 7 

(D) OTHER' INFORMATION: /label* Modif ied- site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /labels Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Adenine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 9 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied- site 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME / KEY : Modif ied-site 

(B) LOCATION: 11 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Adenine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 9 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /label- Modif ied-site 

/note= "Adenine heterocycle is attached to 
N-acetyl (-2 -aminoethyl) glycine through the N-acetyl 
group at position 9 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 
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(B) LOCATION: 13 

(D) OTHER INFORMATION: /label= Modified- site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 14 

(D) OTHER INFORMATION: /label = Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl ( 2 - aminoethyl ) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 15 

(D) OTHER INFORMATION: /label- Modif ied- site 

/note= "Adenine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 9 of the heterocycle . " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
x 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME / KEY : Modif ied-site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle . " 
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(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY; Modif ied-site 

(B) LOCATION: 4 

(D) OTHER INFORMATION : /labels Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /label- Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /labels Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /label* Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modified- site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /labels Modif ied-site 

/notes "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME / KEY : Modif ied- site 

(B) LOCATION: 9 
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(D) OTHER INFORMATION: / label = Modified- site 

/note= "Thymine heterocycle is attached to 

N- acetyl {2 -amino ethyl) glycine through the N-acetyl 

group at position 1 of the heterocycle." 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
1 5 



(2) INFORMATION FOR SEQ ID NO : 7 ; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : Modified- site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /labels Modified- site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /label* Modif ied-site 

/note* "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: / label = Modif ied- site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 
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(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /labels Modif ied-site 

/note= "Cytosine heterocycle is attached to 

N- acetyl (2- aminoethyl) glycine through the N-acetyl 

group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(E) LOCATION: 6 

(D) OTHER INFORMATION: /label* Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /label- Modif ied- site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

( ix) FEATURE : 

(A) NAME /KEY : Modif ied- site 

(B) LOCATION: 8 

(D) OTHER INFORMATION : /label* Modif ied- site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /labels Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
1 5 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
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(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /label* Modif ied-site 

/note* "Thymine heterocycle is attached to 

N- acetyl ( 2 -aminoethyl) glycine through the N-acetyl 

group at position 1 of the heterocycle." 

(ix) FEATURE : 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /label- Modif ied-site 

/note* "Adenine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 9 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 3 

(D) OTHER INFORMATION : /label* Modif ied-site 

/note= "Guanine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 9 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note* "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied- site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /label* Modif ied-site 

/note* "Thymine heterocycle is attached to 
N-acety(2-aminoethyl)glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /label* Modif ied- site 

/note* "Adenine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 9 of the heterocycle." 

(ix) FEATURE: 
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(A) NAME/KEY: Modif ied-s ite 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /label= Modif ied-site 

/ncte= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position l of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl} glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME / KEY : Modif ied-site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 11 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Adenine heterocycle is attached to 
N-acetyl (2- aminoethyl) glycine through the N-acetyl 
group at position 9 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied- site 

(B) LOCATION: 13 

(D) OTHER INFORMATION: /label= Modif ied-site 
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/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 14 

(D) OTHER INFORMATION: /label- Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 15 

(D) OTHER INFORMATION: /label= Modif ied- site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
15 10 15 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : S ingl e 

(D) TOPOLOGY: unknown 
{ix) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : Modif ied- site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /label- Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied- site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Guanine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 9 of the heterocycle." 

(ix) FEATURE: 

(A) NAME / KEY : Modified- site 

(B) LOCATION: 3 
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(D) OTHER INFORMATION : /label- Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note- "Adenine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group ac position 9 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /labels Modif ied-site 

/note= "Guanine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 9 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /label- Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /label- Modif ied-site 

/note- "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

( ix ) FEATURE : 

(A) NAME / KEY : Modif ied-site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /label- Modif ied-site 



BNSDGC1D: <WG _9425*77A2 J > 



WO 94/25477 



PCT/IB94/00142 



- 44 - 

/note= "Adenine heterocycle is attached to 
N-acetyl {2 -aminoethyl) glycine through the N-acetyl 
group at position S of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 10 

(D) OTHER INFORMATION: / label = Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/ KEY : Modif ied-site 

(B) LOCATION: 11 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Adenine heterocycle is attached to 
N-acetyl {2 -aminoethyl) glycine through the N-acetyl 
group at position 9 of the heterocycle . " 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /labels Modif ied-site 

/note= "Adenine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 9 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 13 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied- site 

(B) LOCATION: 14 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle . " 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION : 15 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Adenine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 9 of the heterocycle." 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 
{ii} MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : Modified- site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /label* Modif ied-site 

/note= "Guanine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 9 of the heterocycle . " 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME / KEY : Modif ied- site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Adenine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 9 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 
(3) LOCATION: 5 

"(D) OTHER INFORMATION: /labels Modif ied-site 

/note= "Adenine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 9 of the heterocycle." 
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(ix) FEATURE: 

(A) NAME / KEY : Modified- site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Guanine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 9 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 
<B) LOCATION: 7 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /label* Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Lys 

1 5 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /label* Modif ied- site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied- site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /label* Modified- site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
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group at position 1 of the heterocycle . » 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied- sits 

(B) LOCATION: 4 

(D) OTHER INFORMATION: / label = Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position l of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 6 

(D) OTHER INFORMATION : /label- Modif ied-site 

/note= "Thymine 'heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 
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(B) LOCATION: 9 

(D) OTHER INFORMATION: /label- Modified- site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle . " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

1 5 
(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 
<ii) MOLECULE TYPE: peptide 
( ix) FEATURE : 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /label= Modified- site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /label* Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME / KEY : Modif ied-site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /labei= Modif ied- site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 
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(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION : /labels Modif ied-site 

/ncte= "Thymine heterocycle is attached to 
N- acetyl (2 -amino ethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle . " 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /labels Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 8 

<D) OTHER INFORMATION: /label= Modif ied- site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /label= Modif ied- site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /labels Modif ied- site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle. " 

(ix) FEATURE: 

(A5 NAME /KEY : Modif ied-site 
(B) LOCATION: II 

(D) OTHER INFORMATION: /label= Modif ied-site 
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/note= "Cytosine heterocycle is attached to 

N- acetyl (2 -aminoethyl) glycine through the N-acetyl 

group at position 1 of the heterocycle." 

(ix) FEATURE : 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /labels Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle . " 

{ ix) FEATURE : 

(A) NAME / KEY : Modif ied-site 

(B) LOCATION: 13 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle . " 

(ix) FEATURE: 

(A) NAME / KEY : Modif ied-site 

(B) LOCATION: 14 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

( ix) FEATURE : 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 15 

(D) OTHER INFORMATION: /labels Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME / KEY : Modif ied-site 

(B) LOCATION: 16 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME / KEY : Modified- site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /label* Modified- site 

/note= "Cytosine heterocycle is attached to 

N- acetyl (2 - amino ethyl) glycine through the N-acetyl 

group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 2 

(D) OTHER INFORMATION : /label- Modif ied-site 

/note* "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied- site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /label* Modif ied-site 

/note* "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied- site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /label- Modif ied- site 

/note* "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /label- Modif ied-site 

/note* "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position l of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 6 

(D) OTHER INFORMATION : /label* Modif ied-site 

/note* "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
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group at position 1 of the heterocycle . " 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modified- si te 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Thymine heterocycle is attached tc 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /label* Modif ied-site 

/note* "Thymine heterocycle is attached to 
N-acetyl (2- aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

( ix ) FEATURE : 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /labels Modif ied-site 

/note* "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME / KEY : Modif ied- site 

(B) LOCATION: 11 

(D) OTHER INFORMATION: /label= Modif ied- site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modified- site 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /label* Modif ied-site 

/note* "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied- site 
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(B) LOCATION: 13 

(D) OTHER INFORMATION: /label = Modified- site 

/note= "Thymine heterocycle is attached to 

N- acetyl (2 -aminoethyl) glycine through the N-acetyl 

group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modified- site 

(B) LOCATION: 14 

(D) OTHER INFORMATION: /label= Modified- site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 15 

(D) OTHER INFORMATION: /label= Modif ied- site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

( ix ) FEATURE : 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 16 

(D) OTHER INFORMATION: /label= Modif ied- site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 

(ix) FEATURE : 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /label= Modif ied- site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2- aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 
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(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME / KEY : Modif ied- site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE : 

(A) NAME/KEY: Modif ied-site 

(E) LOCATION: 4 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position l of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied- site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

( ix ) FEATURE : 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 8 
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(D) OTHER INFORMATION: /iabel= Modif xed-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2- aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /label's Modif ied- site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle. 11 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

15 10 
(2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /labels Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2- aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 
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(B) LOCATION: 3 

(D) OTHER INFORMATION: /label- Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoe thy 1) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /labels Modif ied-site 

/note- "Thymine heterocycle is attached to 
N-acetyl (2 -amino ethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied- site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /labels Modif ied-site 

/note- "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /label- Modif ied- site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied- site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /labels Modif ied-site 

/note* "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group .at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME / KEY : Modified- site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /label- Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME / KEY : Modif ied-site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /label- Modif ied-site 

/note- "Cytosine heterocycle is attached to 
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N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle . " 



(ix) FEATURE : 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /label= Modified- site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /label- Modif ied- site 



/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 



(ix) 



FEATURE : 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /label- Modif ied-site 



/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 



(ix) 



FEATURE : 



(A) NAME /KEY : Modif ied-site 



(B) LOCATION: 3 



(D) OTHER INFORMATION: /label= Modif ied- site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2-aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 



(ix) 



FEATURE : 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 4 

<D) OTHER INFORMATION: /label= Modif ied- site 
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/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /label- Modified- site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME / KEY : Modif ied-site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /label- Modif ied-site 

/note- "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /label- Modif ied- site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied- site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /label- Modif ied-site 

/note- "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE : 

(A) NAME /KEY : Modif ied- site 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /label- Modif ied-site 

/note- "Cytosine heterocycle is attached to 
N-acetyl ( 2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 
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(ix) FEATURE : 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 11 

(D) OTHER INFORMATION: /label** Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2- aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME / KEY : Modif ied-site 

(B) LOCATION: 13 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl ( 2 - aminoethyl ) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 14 

(D) OTHER INFORMATION: /label* Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION : 15 

(D) OTHER INFORMATION: /label = Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acety( 2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:16: 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
x 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 amino acids 

(B) TYPE: amino acid 

( c ) STRANDEDNESS : s ingle 
(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : Modified- site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /labels Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 4 

(D) OTHER INFORMATION : /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

( ix) FEATURE : 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied- site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Adenine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 9 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied- site 
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(B) LOCATION: 7 

(D) OTHER INFORMATION: /label= Modified- site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -amino ethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE : 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /label = Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl {2 -amino ethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Adenine heterocycle is attached to 
N-acetyl (2-aminoethyl) glycine through the N-acetyl 
group at position 9 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2-aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 11 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note- "Thymine heterocycle is attached to 
N-acetyl (2-aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /label* Modif ied- site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2-aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 13 

(D) OTHER INFORMATION: /labels Modif ied- site 

/note= "Cytosine heterocycle is attached to 



BNSDOCID: <WO _S*25«i77A2J^ 



PCT/IB94/00142 

WO 94/25477 

- 62 - 

N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle . » 

(ix) FEATURE: 

(A) NAME / KEY : Modif ied-site 

(B) LOCATION: 14 

(D) OTHER INFORMATION: /label* Modif ied-site 

/note* "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 15 

<D) OTHER INFORMATION: /label* Modif ied-site 

/note- "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME / KEY : Modif ied- site 

(B) LOCATION: 16 

(D) OTHER INFORMATION: /label* Modif ied-site 

/note* "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 17 

(D) OTHER INFORMATION: /label* Modif ied-site 

/note- "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of thr heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 18 

(D) OTHER INFORMATION: /label* Modif ied-site 

/note* "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied- site 

(E) LOCATION: 19 

(D) OTHER INFORMATION: /label* Modif ied-site 

/note* "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 
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(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 2 0 

(D) OTHER INFORMATION: /label* Modif ied-site 

/note= "Cytosine heterocycle is attached to 

N- acetyl (2 -aminoethyl) glycine through the N-acetyl 

group at position 1 of the heterocycle." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
x 5-10 15 

Xaa Xaa Xaa Xaa Lys Lys Lys Lys Lys Lys Lys Lys Lys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 1 

<D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE : 

(A) NAME/KEY: Modified- site 
<B) LOCATION: 2 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME / KEY : Modif ied-site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Adenine heterocycle is attached to 
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N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 9 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
(E) LOCATION: 5 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /label = Modif ied-site 

/note= "Adenine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 9 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /label = Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modified- site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /label = Modif ied-site 

/note« "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modified- site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /label- Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /label* Modif ied-site 

/notes "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 
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(A) NAME /KEY: Modif ied-site 
<B) LOCATION: 11 

(D) OTHER INFORMATION: /label= Modif ied- site 

/note= "Guanine heterocycle is attached to 

N- acetyl (2 -aminoethyl) glycine through the N-acetyl 

group at position 9 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl ( 2 - aminoethyl ) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 13 

(D) OTHER INFORMATION: /labels Modif ied- site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 14 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Adenine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 9 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied- site 

(B) LOCATION: 15 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2- aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 16 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 17 

(D) OTHER INFORMATION: /labels Modif ied-site 
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/note= "Guanine heterocycle is attached to 

N- acetyl (2 -aminoethyl) glycine through the N-acetyl 

group at position 9 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 18 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle . " 

(ix) FEATURE : 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 19 

(D) OTHER INFORMATION: /label* Modif ied- site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied- site 

(B) LOCATION: 20 

(D) OTHER INFORMATION: /label* Modif ied-site 

/note* "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
15 10 15 

Xaa Xaa Xaa Xaa Lys 
20 



(2) INFORMATION FOR SEQ ID NO: 19: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /label* Modif ied-site 



PM.qnnrar> <wo 943 5477 A2 i > 



WO 94/25477 



PCT/IB94/00142 



- 67 - 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2- aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modified- site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /label= Modified- site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME / KEY : Modif ied-site 
CB) LOCATION: 3 

(D) OTHER INFORMATION: /labels Modif ied- site 

/note= "Guanine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 9 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: / label = Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2-aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /label= Modif ied- site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2-aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modified- site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /label= Modif ied-site 
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/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /label* Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /label* Modif ied-site 

/note= "Adenine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 9 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /labels Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME / KEY : Modif ied-site 

(B) LOCATION: 11 

(D) OTHER INFORMATION: / label = Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle . " 

(ix) FEATURE: 

(A) NAME /KEY ; Modif ied-site 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /labels Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 13 

(D) OTHER INFORMATION: /label- Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 
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(ix) FEATURE: 

(A) NAME /KEY: Modified- site 

(B) LOCATION: 14 

(D) OTHER INFORMATION: /label= Modified- site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 15 

(D) OTHER INFORMATION: /label* Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 16 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note* "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 17 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 18 

(D) OTHER INFORMATION: /labels Modif ied-site 

/note= "Adenine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 9 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 19 

(D) OTHER INFORMATION: /labels Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 20 
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(D) OTHER INFORMATION: /labels Modif ied-site 

/note= "Thymine heterocycle is attached to 

N- acetyl (2 -amino ethyl) glycine through the N-acetyl 

group at position 1 of the heterocycle." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
1 5 10 15 

Xaa Xaa Xaa Xaa 
20 

(2) INFORMATION FOR SEQ ID NO: 20: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/ KEY : Modif ied- site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /label- Modif ied- site 

/note= "Thymine heterocycle is attached to 
N-acetyl ( 2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /label= Modified- site 

/note= "Adenine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 9 of the haterocycle . " 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /label- Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position l of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /labels Modif ied- s ite 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 



wsisnncirv <wo 9A25477A2 i > 



WO 94/25477 



PCT/BB94/00142 



- 71 - 

( ix) FEATURE : 

(A) NAME /KEY : Modified- site 

(B) LOCATION; 5 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 

N- acetyl (2 -aminoethyl) glycine through the N-acetyl 

group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /label- Modif ied- site 

/note= "Guanine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 9 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /label= Modif ied-site 
/note= "Thymine heterocycle is attached to 

N-acetyl (2- aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /label- Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2- aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Adenine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through uhe N-acetyl 
group at position 9 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 11 
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(D) OTHER INFORMATION: /label- Modified- site 

/note- "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /label- Modif ied-site 

/note* "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 13 

(D) OTHER INFORMATION: /label- Modified- site 

/note- "Guanine heterocycle is attached to 
N-acetyl (2- aminoethyl) glycine through the N-acetyl 
group at position 9 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 14 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note- "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 15 

(D) OTHER INFORMATION: /label- Modif ied-site 

/note- "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 16 

(D) OTHER INFORMATION: /label- Modif ied-site 

/note- "Cytosine heteroside is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME / KEY : Modif ied- site 

(B) LOCATION: 17 

(D) OTHER INFORMATION: /label- Modif ied-site 

/note- "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
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group at position 1 of the heterocycle . " 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 18 

<D) OTHER INFORMATION : /labels Modif ied-site 

/note* "Thymine heterocycle is attached to 
• N-acetyl (2-aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 19 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of thr heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 20 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note* "Adenine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 9 of the heterocycle." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
1 5 10 15 

Xaa Xaa Xaa Xaa Lys 
20 

(2) INFORMATION FOR SEQ ID NO: 21: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME / KEY : Modif ied-site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /labels Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied- site 

(B) LOCATION: 2 
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(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle. 11 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied- site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /labels Modif ied-site 

/note* "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note- "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /label* Modif ied-site 

/note* "Adenine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 9 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied- site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /label* Modif ied- site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME / KEY : Modif ied-site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /label* Modified- site 
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/note= "Cytosine heterocycle is attached to 

N- acetyl (2-aminoethyl ) glycine through the N-acetyl 

group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/ KEY : Modif ied-site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /label= Modif ied- site 

/note= "Adenine heterocycle is attached to 

N- acetyl (2-aminoethyl) glycine through the N-acetyl 

group at position 9 of the heterocycle . " 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /label- Modif ied- site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2-aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

( ix ) FEATURE : 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 11 

(D) OTHER INFORMATION: /label- Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2-aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

( ix) FEATURE : 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /labels Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2-aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied- site 
(3) LOCATION: 13 

(D) OTHER INFORMATION: /label* Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2-aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 14 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2-aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 
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(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION : 15 

(D) OTHER INFORMATION: /labels Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyi (2 -amino ethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 16 

(D) OTHER INFORMATION: /labels Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle," 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 17 

(D) OTHER INFORMATION: /labels Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 18 

(D) OTHER INFORMATION: /labels Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 19 

(D) OTHER INFORMATION: /label- Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 2 0 

(D) OTHER INFORMATION: /label* Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 21: 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
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15 10 15 

Xaa Xaa Xaa Xaa 
20 

(2) INFORMATION FOR SEQ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE : 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /labels Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /labels Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE : 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /labels Modif ied-site 

/note= "Guanine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 9 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modified- site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modified- site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /labels Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 - aminoethyl ) glycine through the N-acetyl 
group at position 1 of the heterocycle." 
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(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /labels Modif ied-site 

/note= "Guanine heterocycle is attached to 

N- acetyl (2- aminoetiiyl) glycine through the N-acetyl 

group at position 9 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/ KEY : Modif ied-site 

(B) LOCATION: 7 

(D) OTHER INFORMATION : /label* Modif ied- site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2-aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2-aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2-aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /labels Modif ied- site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2-aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied- site 

(B) LOCATION: 11 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2-aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle . " 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied- site 

(B) LOCATION: 12 
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(D) OTHER INFORMATION: /label* Modified- site 

/note= "Guanine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 9 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 13 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 14 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 15 

<D) OTHER INFORMATION: /label= Modif ied- site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2- aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 16 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 17 

(D) OTHER INFORMATION: /labels Modif ied-site 

/note= "Adenine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 9 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied- site 

(B) LOCATION: 18 

(D) OTHER INFORMATION: /label= Modif ied-site 
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/note= "Guanine heterocycle is attached to 

N- acetyl (2 -aminoethyl) glycine through the N-acetyl 

group at position 9 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 19 

(D) OTHER INFORMATION: /label* Modif ied-site 

/notes "Guanine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 9 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 2 0 

(D) OTHER INFORMATION : /label- Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of thr heterocycle." 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 22 : 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
1 5 10 15 

Xaa Xaa Xaa Xaa Lys 
20 

(2) INFORMATION FOR SEQ ID NO: 23: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME / KEY : Modif ied- site 
(3) LOCATION: 1 

(D) OTHER INFORMATION: /label* Modif ied-site 

/note* "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modified- site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /label* Modif ied- site 

/note* "Thymine heterocycle is attached tc 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 
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(A) NAME /KEY : Modified- site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /label- Modif ied-site 

/notes "Thymine heterocycle is attached to 

N- acetyl (2 -aminoethyl) glycine through the N-acetyl 

group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /label= Modif ied-site 

/note= "Thymine heterocycle is attached to 
N _ acet yl(2-aminoethyl)giycinr through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /label= Modif ied-site 
/note= "Beta isoform of alanine." 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /label** Modif ied-site 

/notes "Cytosine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE : 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /labels Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /label- Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME / KEY : Modif ied-site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /labels Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
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group at position l of the heterocycle . " 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 10 

(D) OTHER INFORMATION : /label= Modif ied-site 

/note= "Thymine heterocycle is attached to 

N- acetyl (2 -aminoethyl) glycine through the N-acetyl 

group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 11 

(D) OTHER INFORMATION: /label* Modif ied-site 

/note= "Thymine heterocycle is attached to 

N- acetyl (2 -aminoethyl) glycine through the N-acetyl 

group at position 1 of the heterocycle." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

Xaa Xaa Xaa Xaa Ala Xaa Xaa Xaa Xaa Xaa Xaa Lys 
15 10 

(2) INFORMATION FOR SEQ ID NO: 24: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /label* . Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /label= Modif ied- site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME / KEY : Modif ied-site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /label* Modif ied-site 

/note= "Thymine heterocycle is attached to 
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N-acetyl ( 2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle . " 

(ix) FEATURE : 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /Iabel= Modif ied- site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /label= Modif ied-site 
/note= "Beta isof orm of alanine . " 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /label- Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2-aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 7 

(D) OTHER INFORMATION: /labels Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2-aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modified- site 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /label* Modif ied- site 

/note* "Thymine heterocycle is attached to 
N-acetyl (2-aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /label = Modif ied-site 

/note= "Thymine heterocycle is attached to 
N-acetyl (2-aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME / KEY : Modif ied-site 

(B) LOCATION: 10 
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(D) OTHER INFORMATION : /label= Modif ied-site 

/note- "Thymine heterocycle is attached to 

N- acetyl (2 -aminoethyl) glycine through the N-acetyl 

group at position 1 of the heterocycle." 

(ix) FEATURE: 

(A) NAME /KEY : Modified- site 

(B) LOCATION: 11 

(D) OTHER INFORMATION: /labels Modif ied-site 

/note= "Thymine heterocycle is attached to ^ 
N-acetyl (2 -aminoethyl) glycine through the N-acetyl 
group at position 1 of the heterocycle." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Xaa Xaa Xaa Xaa Ala Xaa Xaa Xaa Xaa Xaa Xaa Lys 
1 5 10 
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WHAT IS CLAIMED IS: 



1 . A compound having the formula : 



L 



L 



2 



Q 




A 




-c 



n 




wherein: 



5 



n is at least 2, 

each of L 1 -!/ 1 is independently selected from the 



group consisting of hydrogen, hydroxy, (C-l-CJ alkanoyl, 
naturally occurring nucleobases, non-naturally occurring 
nucleobases, aromatic moieties, DNA intercalators , 
nucleobase-binding groups, heterocyclic moieties, and 
' 10 reporter ligands, at least one of L 1 -!/ 1 being a naturally 

occurring nucleobase, a non-naturally occurring nucleobase, a 
DNA intercalator, or a nucleobase-binding group; 

each of C x -C a is (CR*R 7 ) y where R 6 is hydrogen and R 7 
is selected from the group consisting of the side chains of 
15 naturally occurring alpha amino acids, or R 6 and R 7 are 

independently selected from the group consisting of hydrogen, 
(C 2 -C 6 ) alkyl, aryl , aralkyl, heteroaryl, hydroxy, (CV 
C 6 )alkoxy, (C^-Cg) alkyl thio, NR 3 R 4 and SR 5 , where R 3 and R 4 are 
as defined above, and R 5 is hydrogen, (C^-Cg) alkyl , hydroxy-, 
20 alkoxy-, or alkylthio- substituted (C^-Cg) alkyl , or R 6 and R 7 
taken together complete an alicyclic or heterocyclic system; 

each of D 1 -D n is (CR 6 R 7 ) Z where R 5 and R 7 are as 
defined above; 

each of y and z is zero or an integer from 1 to 10, 
25 the sum y + z being greater than 2 but not more than 10; 

each of G l -G r " 1 is -NR 3 CO- , -NR 3 CS- , -NR 3 SO- or - 
NR 3 S0 2 -, in either orientation, where R 3 is as defined above; 
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each of A L -A n and B^B* are selected such that: 

(a) A is a group of formula (Ila), (lib) or (lie), 
and B is N or R 3 N + , provided that at least one A is 
a group of formula (lie); or 

(b) A is a group of formula (I Id) and B is CK; or 

(c) A is a group of formula (Ila) or (lib) and B 
is N or R 3 N + , provided at least one of y or z is not 
1 or 2; 



R 1 



(Ila) 





R 1 




F 












— c— 


— Y — 


—I 




_ k 


r 


f 



0 

II 

N C- 



(Ilb) 





R 1 




R 1 


x r3 

II 1 








— C— 






— c— - 


— Y— 




C N 




_ R 2 _ 


r 


_! 2 _ 


s 



10 



where : 



15 



20 



25 



(HO 



(lid) 



X is O, S, Se, NR 3 , CH 2 or C(CH 3 ) 2 ; 

Y is a single bond, 0, S or NR 4 ; 

each of p and q is zero or an integer from 1 
to 5, the sum p+q being not more than 10; 

each of r and s is zero or an integer from 1 
to 5, the sum r+s being not more than 10; 

each R 1 and R 2 is independently selected from 
the group consisting of hydrogen, (Ci-Cjalkyl which 
may be hydroxy- or alkoxy- or alkylthio- 
substituted, hydroxy, alkoxy, alkylthio, amino and 
halogen; and 

each R 3 and R 4 is independently selected from 
the group consisting of hydrogen, (C^-CJ alkyl , hydroxy - 

or alkoxy- or alkylthio-substituted (C^-CJ alkyl , 
hydroxy, alkoxy, alkylthio and amino; 
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Q is -C0 2 H, -CONR'R", -S0 3 H or -S0 2 NR'R'' or an 
activated derivative of -C0 2 H or -S0 3 H; and 

I is -NHR"'R'' # ' or -NR" ' C (0) R" " , where R' , R", 
R' ' ' and R' ' ' ' are independently selected from the group 
5 consisting of hydrogen, alkyl, amino protecting groups, 
reporter ligands, intercalators, chelators, peptides, 
proteins, carbohydrates, lipids, steroids, oligonucleotides 
and soluble and non- soluble polymers. 
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2 . The compound of claim 1 having the formula : 




wherein: 

each L is independently selected from the group 
consisting of hydrogen, phenyl, heterocyclic moieties, 
5 naturally occurring nucleobases, and non-naturally occurring 
nucleobases ; 

each R 7 ' is independently selected from the group 
consisting of hydrogen and the side chains of naturally 
occurring alpha amino acids; 
io n is an integer from 1 to 60, 

each k, 1, and m is, independently, zero or an 
integer from 1 to 5; 

each p is zero or 1; 
R h is OH, NH 2 or -NHLysNH 2 ; and 
is R 1 is H or C0CH 3 . 



3. The compound of claim 1 having the formula: 

( C H , ) , 



,(CH 2 K- 



0 

N 
H 



•(CH 2 ) k , 



(CH 2 ) 



N H - R 1 
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wherein : 

each L is independently selected from the group 
consisting of hydrogen, phenyl, heterocyclic moieties, 
naturally occurring nucleobases, and non-naturally occurring 
5 nucleobases; 

each R 7 ' is independently selected from the group 
consisting of hydrogen and the side chains of naturally 
occurring alpha amino acids; 

n is an integer from 1 to 60, 
io each k, 1, and m is, independently, zero or an 

integer from 1 to 5; 

each p is zero or 1; 

R h is OK, NK 2 or -NKLysNH 2 ; and 

R 1 is H or COCH 3 . 



is 4 . A compound having one of the following 

formulas : 

L L L 

1 1 

A AA 

SAv^ o, K c /^ c / E or ^ C A D / F 



wherein : 

L is selected from the group consisting of 
hydrogen, hydroxy, (C^-CJ alkanoyl , naturally occurring 

20 nucleobases, non-naturally occurring nucleobases, aromatic 
moieties, DNA intercalators , nucleobase-binding groups, and 
heterocyclic moieties, reporter ligands, wherein amino groups 
are, optionally, protected by amino protecting groups; 

each C is (CR 6 R 7 ) y where R 6 is hydrogen and R 7 is 

25 selected from the group consisting of the side chains of 
naturally occurring alpha amino acids, or R 6 and R 7 are 
independently selected from the group consisting of hydrogen, 
(C 2 -C 6 ) alkyl, aryl , aralkyl, heteroaryl, hydroxy, (C^ 
C 6 )alkoxy, (C^-Cg) alkylthio , NR 3 R 4 and SR 5 , where R 3 and R 4 are 

30 as defined above, and R 5 is hydrogen, (Cj-Cg) alkyl , hydroxy-, 
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lQ 



alkoxy-, or alkylthio- substituted (C^-Cg) alkyl , or R 6 and R 7 
taken together complete an alicyclic or heterocyclic system; 
each D is (CR 6 R 7 ) 2 where R 6 and R 7 are as defined 

above ; 

each of y and z is zero or an integer from 1 to 10, 
the sum y + z being greater than 2 but not more than 10; 
A and B are selected such that : 

(a) A is a group of formula (lie) and B is N or 
R 3 N* ; or 

(b) A is a group of formula (lid) and B is CH; or 

(c) A is a group of formula (Ila) or (lib) and B 
is N or R 3 N + , provided at least one of y or z is not 
1 or 2; 



(Ila) 



(lib) 



-N C— 



R 1 



X V 

II I 

-C N- 



15 



(HO 



(lid) 



where : 



20 



25 



X is O, S, Se, NR 3 , CH 2 or C(CH 3 ) 2 ; 

Y is a single bond, O, S or NR 4 ; 

each of p and q is zero or an integer from 1 
to 5, the sum p+q being not more than 10; 

each of r and s is zero or an integer from 1 
to' 5, the sum r+s being not more than 10; 

each R 1 and R 2 is independently selected from 
the group consisting of hydrogen, (C^-Cjalkyl which 
may be hydroxy- or alkoxy- or alkylthio - 
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substituted, hydroxy, alkoxy, alkylthio, amino and 
halogen; and 

each R 3 and R 4 is independently selected from 
the group consisting of hydrogen, (C^-CJ alkyl, hydroxy- 
5 or alkoxy- or alkyithio-substituted (C^CJ alkyl , 

hydroxy, alkoxy, alkylthio and amino; 

each. E is COOH, CSOH, SOOH, S0 2 OH or an activated or 
protected derivative thereof; and 

each F is NHR 3 or NPgR 3 , where R 3 is as defined 
io above, and Pg is an amino protecting group. 

5. The compound of claim 4 having the formula: 




wherein: 

each L is independently selected from the group 
consisting of hydrogen, phenyl, heterocyclic moieties, 
15 naturally occurring nucleobases, and non-naturally occurring 
nucleobases ; 

each R 7 ' is independently selected from the group 
consisting of hydrogen and the side chains of naturally 
occurring alpha amino acids; and 
20 each k, 1, and m is, independently, zero or' an 

integer from 1 to 5 . 
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6 . The compound of claim 4 having the formula : 




(CH 2 ) 




wherein : 

each L is independently selected from the group 
consisting of hydrogen, phenyl, heterocyclic moieties, 
5 naturally occurring nucleobases, and non-naturally occurring 
nucleobases / 

each R 7 ' is independently selected ■ from the group 
consisting of hydrogen and the side chains of naturally 
occurring alpha amino acids ; and 
io each k, 1, and m is, independently, zero or an 

integer from 1 to 5 . 



7. A process for preparing a compound according to 
claim 1, comprising the steps of: 

A) providing a polymer substrate, said polymer 

15 being f unctionalized with a chemical group capable of forming 
an anchoring linkage with an amino acid; 

B) coupling said polymer with a first amino acid 
through said anchoring linkage, said first amino acid having 
formula (IV) : 

L 




20 (IV) 
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wherein : 

L is selected from the group consisting of 
naturally occurring nucleobases, non-naturally 
occurring nucleobases, aromatic moieties, DNA 
5 intercalators, nucieobase -binding groups, 

heterocyclic moieties, and reporter ligands, 
wherein amino groups are, optionally, protected by 
amino protecting groups ; 

each C is (CR 6 R 7 ) y where R 6 is hydrogen and R 7 is 
io selected from the group consisting of the side chains of 
naturally occurring alpha amino acids, or R 6 and R 7 are 
independently selected from the group consisting of hydrogen, 
(C 2 -C 6 ) alkyi , aryl , aralkyl, heteroaryl, hydroxy, (C x - 
C 6 )alkoxy, (C^-Cg) alkylthio, NR 3 R 4 and SR 5 , where R 3 and R 4 are 
is as defined above, and R 5 is hydrogen, (C^-Cg) alkyl , hydroxy-, 
alkoxy-, or alkylthio- substituted (C^-Cg) alkyl , or R 6 and R 7 
taken together complete an alicyclic or heterocyclic system,* 
each D is (CR e R 7 ) 2 where R s and R 7 are as defined 

above ; 

20 each of y and z is zero or an integer from 1 to 10, 

the sum y +■ z being greater than 2 but not more than 10 ; 
A and B are selected such that : 

(a) A is a group of formula (lie) and B is N or 
R 3 N\- or 

25 (b) A is a group of formula (lid) and B is CH; or 

(c) A is a group of formula (Ila) or (lib) and B 
is N or R 3 N*, provided at least one of y or z is not 
1 or 2 ; 



(Ila) 



(lib) 
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where : 

X is 0, S, Se, NR 3 , CH 2 or C(CH 3 ) 2 ; 
Y is a single bond, 0, S or NR 4 ; 
5 each of p and q is zero or an integer from 1 

to 5, the sum p+q being not more than 10; 

each of r and s is zero or an integer from 1 
to 5, the sum r+s being not more than 10; 

each R 1 and R 2 is independently selected from 
10 the group consisting of hydrogen, (C 1 -C 4 )alkyl which 

may be hydroxy- or alkoxy- or alkylthio - 
substituted, hydroxy, alkoxy, alkylthio, amino and 
halogen; and 

each R 3 and R 4 is independently selected from 
is the group consisting of hydrogen, (C x -C 4 ) alkyl , hydroxy- 

or alkoxy- or alkylthio-substituted (C^-CJ alkyl , 
hydroxy, alkoxy, alkylthio and amino; 

each E is COOH, CSOH, SOOH, S0 2 0H or an activated or 
protected derivative thereof; and 
20 each F is NHR 3 or NPgR 3 , where R 3 is as defined 

above, and Pg is an amino protecting group; 

C) removing said amino protecting group from said 
coupled first amino acid to generate a free amino group; and 

D) reacting said free amino group with a second 
25 amino acid having formula (IV) to form a peptide chain. 



8. The process of claim 7 further comprising the 

steps of : 

E) removing said amino protecting group from said 
second amino acid to generate a terminal free amino group on 
30 said peptide chain; and 
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F) reacting said free amino group on said peptide 
chain with a further amino acid having formula (IV) to 
lengthen said peptide chain. 

9. The process of claim 8 wherein steps E and F 
s are performed a plurality of times. 

10. The process of claim 8 further comprising 
removing at least one protecting group remaining on the amino 
acid moieties of the peptide chain. 

11. The process of claim 7 further comprising 
10 cleaving said anchoring linkage without substantially 

degrading said peptide chain. 

12 . The process of claim 7 wherein the polymer 
substrate contains polystyrene, polyacrylamide, silica, a 
composite material, cotton, or a derivative thereof. 

is 13 . The process of claim 8 wherein the chemical 

group capable of forming said anchoring linkage is chloro-, 
bromo- and iodo- substituted alkyl, amino-substituted alkyl, 
amino and aryl- substituted alkyl, amino- and alkyiaryl- 
substituted alkyl, hydroxy- substituted alkyl, or a derivative 

20 thereof having a spacer group that can be cleaved 

substantially without degradation of said polypeptide. 

14. The process of claim 13 wherein chloro-substi- 
tuted alkyl is chloromethyl , amino-substituted alkyl is 
aminomethyl, amino- and alkyl- substituted aryl is a- 
25 aminobenzyl, amino- and alkylaryl- substituted alkyl is 

selected from the group consisting of a-amino-3- and a-amino- 
4-methylbenzyl, and hydroxy- substituted alkyl is 
hydroxymethyl . 
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15. The process of claim 13 wherein: 

the chemical group is derived from an arnino-containing moiety 
selected from amino-substituted alkyl, amino- and aryl 
substituted alkyl, and amino- and alkylaryl -substituted 
5 alkyl ; and 

the chemical group includes a spacer group derived 
from the group consisting of 4- (haloalkyl) aryl -lower alkanoic 
acids, Boc-aminoacyl-4- (oxymethyl) aryl -lower alkanoic acids, 
N-Boc-p-acylbenzhydrylamines , N-Boc-4' - (lower alkyl) -p- 
10 acylbenzhydryl amines, N-Boc-4 (lower alkoxy) -p- 

acylbenzhydrylamines, and 4-hydroxymethylphenoxy-lower 
alkanoic acids . 

16. A process for sequence-specific recognition of 
a double -stranded polynucleotide, comprising contacting said 

15 polynucleotide with a compound that is different from natural 
RKTA and that binds to one strand of the polynucleotide, 
thereby displacing the other strand, said being the compound 
of claim 1. 

17. A process for modulating the expression of a 
20 gene in an organism, comprising administering to said 

organism a compound according to claim 1 that specifically 
binds to DNA or RNA deriving from said gene, said compound 
being the compound of claim 1. 

18. The process of claim 17 wherein said 

25 modulation includes inhibiting transcription of said gene. 

19 . The process of claim 17 wherein said 
modulation includes inhibiting replication of said gene. 

20 . A process for treating conditions associated 
with undesired protein production in an organism, comprising 
30 contacting said organism with an effective amount of a 

compound according to claim 1 that specifically binds with 
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DNA or ENA deriving from a gene controlling said protein 
Droduction. 



21. A process for inducing degradation of DNA or 
RNA in cells of an organism, comprising administering to said 

5 organism a compound according to claim 1 that specifically 
binds to said DNA or RNA. 

22. A process for killing cells or virus, 
comprising contacting said cells or virus with a compound 
according to claim 1 that specifically binds to a portion of 

io the genome of said cells or virus. 

23. A pharmaceutical composition comprising a" 
compound according to claim 1 and at least one 
pharmaceutically effective carrier, binder, thickener, 
diluent, buffer, preservative, or surface active agent. 
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